Linear and nonlinear speech feature analysis for stress classification
نویسندگان
چکیده
There are many stressful environments which deteriorate the performance of speech recognition systems. Examples include aircraft cockpits, 911 emergency telephone response, high workload task stress, or emotional situations. To address this, we investigate a number of linear and nonlinear features and processing methods for stressed speech classi cation. The linear features include properties of pitch, duration, intensity, glottal source, and the vocal tract spectrum. Nonlinear processing is based on our newly proposed Teager Energy Operator (TEO) speech feature which incorporates frequency domain critical band lters and properties of the resulting TEO autocorrelation envelope. In this study, we employ a Bayesian hypothesis testing approach and a hidden Markov model (HMM) processor as classi cation methods. Evaluations focused on speech under loud, angry, and the Lombard e ect1 from the SUSAS database. Results using receiver operating characteristic (ROC) curves and EER (equal error rate) based detection show that pitch is the best of the ve linear features for stress classi cation; while the new nonlinear TEO-based feature outperforms the best linear feature by +5.2%, with a reduction in classi cation rate variability from 8.66 to 3.90.
منابع مشابه
Improving of Feature Selection in Speech Emotion Recognition Based-on Hybrid Evolutionary Algorithms
One of the important issues in speech emotion recognizing is selecting of appropriate feature sets in order to improve the detection rate and classification accuracy. In last studies researchers tried to select the appropriate features for classification by using the selecting and reducing the space of features methods, such as the Fisher and PCA. In this research, a hybrid evolutionary algorit...
متن کاملMethods for stress classification: nonlinear TEO and linear speech based features
Speech production variations due to perceptually induced stress contribute signiicantly to reduced speech processing performance. One approach that can improve the robustness of speech processing (e.g., recognition) algorithms against stress is to formulate an objective classiication of speaker stress based upon the acoustic speech signal. In this paper , an overview of recent methods for stres...
متن کاملNonlinear feature based classification of speech under stress
Studies have shown that variability introduced by stress or emotion can severely reduce speech recognition accuracy. Techniques for detecting or assessing the presence of stress could help improve the robustness of speech recognition systems. Although some acoustic variables derived from linear speech production theory have been investigated as indicators of stress, they are not always consiste...
متن کاملHyperspectral Image Classification Based on the Fusion of the Features Generated by Sparse Representation Methods, Linear and Non-linear Transformations
The ability of recording the high resolution spectral signature of earth surface would be the most important feature of hyperspectral sensors. On the other hand, classification of hyperspectral imagery is known as one of the methods to extracting information from these remote sensing data sources. Despite the high potential of hyperspectral images in the information content point of view, there...
متن کاملClassification of speech under stress based on features derived from the nonlinear Teager energy operator
Studies have shown that distortion introduced by stress or emotion can severely reduce speech recognition accuracy. Techniques for detecting or assessing the presence of stress could help neutralize stressed speech and improve robust-ness of speech recognition systems. Although some acoustic variables derived from linear speech production theory have been investigated as indicators of stress, t...
متن کامل